Week 5
Milestones
- Tested the interface for certificate data obtained in PDF format.
- Converted the obtained certificate data from PDF to XML format using PDF.js library.
- Tested the DLP platform for certificates from different organizations.
Contributions
Learnings
- Learnt about PDF parsing techniques using PDF.js library.
- Creation of a structured XML document with meaningful tags from the extracted PDF document.